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THE IMBEDDING PROBLEM FOR RIEMANNIAN MANIFOLDS 


By JoHn NasH 
(Received October 29, 1954) 
(Revised August 20, 1955) 


Introduction and remarks 


History. The abstract concept of a Riemannian manifold is the result of an 
evolution in mathematical attitudes [1, 2]. In an earlier period mathematicians 
thought more concretely of surfaces in 3-space, of algebraic varieties, and of the 
Lobatchevsky manifolds. As the more abstract view of manifolds came into 
favor a question naturally arose: To what extent are the abstract Riemannian 
manifolds a more general family than the sub-manifolds of euclidean spaces? 

This question has been considered in various specializations and with assorted 
side conditions. In 1873 Schlaefli [3] discussed the local form of this imbedding 
problem. He conjectured that a neighborhood in an n-manifold would generally 
require an imbedding space of (n/2) (n + 1) dimensions. In 1901 Hilbert [4] 
obtained a negative result, showing that the Lobatchevsky plane is not realizable 
as a smooth surface in E?. Some contemporary negative theorems are due to 
Tompkins [5] and to Chern and Kuiper [6]. For example, a flat n-torus is not 
realizable in less than 2n dimensions. 

Janet [7] solved the local problem for two-manifolds with analytic metric in 
1926, and Cartan [8] immediately extended the result to n-manifolds, treating it 
as an application of his theory of Pfaffian forms. The dimensionality requirement 
was (n/2) (n + 1), as conjectured by Schlaefli. This number is a plausible one, 
being the number of components of the metric tensor. The proof depended on 
power series development, so it was limited to local results and it required that 
the metric be analytic. 

There are some theorems on the existence of isometric imbeddings in infinite 
dimensional spaces. This is a much simpler problem. 

A recent discovery [9, 10] is that C’ isometric imbeddings of Riemannian 
manifolds can be obtained in rather low dimensional spaces. At first glance some 
of these C’ results seem inconsistent with the negative theorems, such as Hilbert’s. 
Apparently C’ imbeddings are very different from the smoother ones. 

Until recently the only general results on imbeddings in the large were proved 
for the problem of Weyl. This problem is to realize in E* all two-manifolds with 
everywhere positive Gaussian curvature. Alexandrov [13] and Pogorelov [14] 
have been successful with a geometrical approach based on polyhedral approxi- 
mations. H. Lewy [12] and L. Nirenberg [15] have treated the problem from the 
viewpoint of partial differential equations. These results can probably be sharp- 
ened with respect to differentiability, but dimension-wise they are clearly 
optimal. 7 

Rigidity theory concerns the metric preserving perturbations of an imbedding. 

20 
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A closed convex surface in EF’ is rigid, because it admits only trivial perturbations. 
But it becomes flexible if there is a hole in it. Apparently rigidity disappears 
completely when the imbedding space has enough dimensions. 

Arrangement of this paper. There are four main divisions, called Parts A, B, C, 
and D. At the end of Part C the treatment of compact manifolds is complete 
and we state Theorem 2, which is essentially this: Every compact Riemannian 
n-manifold is realizable as a sub-manifold of Euclidean (n/2). (8n + 11)-space. 
In Part D this is made to apply to non-compact manifolds by means of a device 
which reduces the non-compact problem to the compact case. The device is 
extravagant with dimensions. Theorem 3 realizes non-compact n-manifolds in 
(n/2)(n + 1)(8n + 11) dimensions. 

The core of this paper is in Part B. There a perturbation process is developed 
and applied to construct a small finite perturbation of an imbedding such that 
the perturbed imbedding induces a metric that differs by a specified small amount 
from the metric induced by the original imbedding. This work is summarized at 
the end of Part B in Theorem 1. The interesting thing about the perturbation 
process is that it does not seem special to this imbedding problem. It may be an 
illustration of a general method applicable to a variety of problems involving 
partial differential equations. 

Part A is devoted to the fairly straight- fara construction of a smoothing 
operator of the type required by the method of Part B. The operator’s main 
properties are stated in equations (A15, 16, 17). In general, the four Parts are 
relatively independent in notation. Each depends only on the main results of the 
preceding part, not on the details. 

Remarks. Some respects in which the results here should be improvable are 
these: The dimension bounds for the imbedding space should be lowered; the 
C” case should be included; and it should be proved that the process gives an 
analytic imbedding when the metric is analytic. The treatment of the C’ and 
analytic cases would require new sets of estimates. A more unified approach to 
the problem which would not require the use of two separate sets of imbedding 
functions might reduce the dimension requirements substantially. 

The methods used here may prove more fruitful than the results. Time will 
tell how much can be done with smoothing and ‘‘feed-back”’ methods like those 
applied in Part B. The device of Part D suggests an alternative way to imbed 
general two-manifolds by exploiting the results on Weyl’s problem. 

Acknowledgement. I am profoundly indebted to H. Federer, to whom may be 
traced most of the improvement over the first chaotic formulation of this work. 
N. Levinson also gave very helpful advice and information; and through con- 
structive criticism, several others at M.I.T. helped me to improve the paper. 
This paper was supported in part by the Office of Naval Research. 


Part A: A GENERAL SMOOTHING OPERATOR 


This part develops an analytical tool, the smoothing operator, which is es- 
sential to the perturbation process developed in Part B. A smoothing operator is 
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first constructed to act on real functions of n-variables, that is functions on EK”. 
Then we define a smoothing operator for a manifold IR by imbedding I in EH” 
and extending scalar functions on Jt to functions on H”, which can be smoothed 
by the £” operator. Finally, we devise a canonical representation for tensors on 
Mt in terms of sets of scalars and use this to smooth tensors. We also obtain three 
important general inequalities that describe the action of smoothing on functions. 


Smoothing functions on E” 


Real functions of n-variables are smoothed by convolution with a certain 
kernel, Ks , that we define below. Here @ is a parameter controlling the degree of 
smoothing. Ky is defined by defining its Fourier transform Kg . 

Let ¥(u) be a C” function such that 


foru Sl: y(u) = 1, 
for! < u S 2: y(u) is monotone decreasing, 
foru = 2: y(u) = 0. 
To illustrate, we could take y = e“/"-”/=-2) in the range 1 <u < 2. 
Suppose 2, 22, °°: NM, are the coordinates of HE” and £,, &,---, & are the 


corresponding coordinates of the Fourier transform space. We define the trans- 
form Ke of the kernel as 


Ke = ¥(&/8), where 
b= (@+84+---&)h 
Thus Ko is a spherically symmetric non-negative C” function which is 1 inside 
the sphere £ = 6, zero outside the sphere £ = 26, and smoothly decreasing with 
£ in the annular region between the two spheres. 

Ko is the transform of Ke ; so Ke is spherically symmetric; it is real because Ko 
is even; it is analytic because Ky vanishes, except for & < 26; and | Ko| de- 
creases as rapidly as any negative power of the distance because all derivatives 
of Ky are continuous. 

As @ varies K» will be more or less concentrated at the origin. But the integral 
over all of KE” of Ke will always be the same. Since variation of 6 changes Ko 
only in a way corresponding to a change of scale in the transform space, Ke will 
change in a similar way, with normalization being preserved. Specifically, we 
can relate Ks to Ki (where 0 = 1) by the equation 


(A2) Ko(x1 , 22, °°* tn) = 0° Ky(601, 02, +--+ OXn). 


I 


(Al) 


Effect of convolution on derivatives 


Convolution and differentiation commute under favorable conditions: 


: (Ke *f) “(2 


Ox; OX; 


(A3) Ks) * f. 
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In our applications of this f will vanish outside a compact region. Since Ko 
(and also its derivatives, as we see below) decreases rapidly away from the 
origin, the favorable conditions will be more than met. 

To consider a derivative of K» , observe that 0K»/dx;, which we abbreviate 
to Ke ; satisfies 


Thus the transform of Ke,; is a C’ function and vanishes outside the sphere 
£ = 20. So | Ke.;| decreases rapidly in the same manner as | Ky |, and the same 


is true for the higher derivatives of Ke . 
From (A2) we see that 


Kolar, +++ 5 tn) = 0° Ry, (001, +++ , Orn), 
so (A8) can be written as 
(Kesf)i = 0 Ky (0x;, ++ , Otn) * f 
= of ——<~ | Kus, _ , OY n) fla 2 YW; iS e y Un cana Yn) dy, mane AY» 


ll 


a | = [ Kia, an »2n) f(a — 2/0), ee Oy 2,/0)) dz, +++ dz,,. 


Here we considered the integral formula for convolution, letting the kernel carry 
the dummy variables yi, y2, °°: , Yn. Then we made a change of variables: 
dy; = 2;. Now using the last equation we can say 


(Ke «foal < a(max|f|) f --- f | Krs| der +++ de, 
C 6 max | f |, 


IA 


IIA 


where C is the integral of the absolute value of K,,;. Similarly we could obtain a 
bound for the size of a higher derivative of Ks * f. Each such bound would involve 
a constant, analogous to C’, and @ to the power equal to the order of the deriva- 
tive. A higher order derivative can be considered a derivative of a lower order 
derivative and we can bound it in terms of the maximum size of that derivative, 
before convolution with Ke . To illustrate, 


(Ko *f),j: = (Ko *fis)e = (Kos * fa), 
| (Ko * f) 93 | <= C @ max lh | . 
If we are not concerned with the precise sizes of the constants, such as C’, which 
appear in these estimates we can put them all in one comprehensive statement. 
Let max“ f stand for the maximum of the values attained by the absolute values 


of all derivatives of f of order s at all points of EZ”. 
Then we can say generally: 


(A4) max” (Ky « f) < C,.6"° max” f, when r = s. 
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24 JOHN NASH 
Here C,, is to be a constant, independent of f, and in fact depending only on 
r— 8. 


Effect of varying 6 


We need to know how rapidly K» « f and its derivatives change as 6 varies. 
Of course we can say 


) _ 0Ke 
- ap he ed) = ay ed, 
since we do not think of f as varying with 0. 0K»/06@ will also be a kernel with good 


properties, as we see below. 
We can begin by considering 


0 = 0s _ 6) ae 2a — gol 
(A5) ag fe = ag = ap [W(E/0)] = —E/0 W'(E/0) = O x(E/8). 
Here we introduce a new function, x, defined by x(u) = —uy’(u). Observe that 


x(u) = Oforu < loru S 2, and that x(u) > Ofor1l < u < 2. Alsox is C”, 
so we see that 0K.»/06 has the same general properties of analyticity, smallness 
at infinity, etc., that Ke has. 

Let Z stand for the value of 0K,/06 at 6 = 1. We can express 0K,/06 in terms 
of Z in a manner analogous to the way we obtained in (A2) an expression for 
Ke as 0"K, (6x1 , --- ). The only difference in the situations is the appearance of 
@* in (A5). So we obtain | 


(A6) < Ke = 9" L(x, , 0x2, --+ , O2n). 


We want to express (a, %2, --: , Xn) as the sum of n special functions Z; . 
We do this via the transform of L, which is x(£). Thus we shall have 


(A7) L =x) = Dake. 
To define the L; we construct a non-negative C~ function a; for each transform 
variable &€; : 

a, =0, for |&|S (2n), 
and 

a, = el @n FNRI 

for | | > (Qn). ; 
Observe that when é > (2)? thenéi + & + --- + £7, > 3 andsome £; > (2n)+ 


so that some a; > 0. Therefore a, + a2 + ++: + a, is positive and C” in the 
region 1 < & S 2 where x(§) is non-vanishing. So we can define 


[Sa 
I 


a; 
a for €21, and 


L;=0, for & <1. 
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These functions will be C” everywhere and will satisfy (A7). Each ZL; has the 
important properties: 


(a) L; =0 for &€=2 


The corresponding kernels L; will clearly have all the good behavior properties 
at infinity, etc., that K, has. And we have 


the value at @ = | 


Dak = L= 


of 5 o Ke 


Purpose of the L; kernels 


Each L; is so constructed that when one forms the indefinite integral i L, dz; 


the result is still a function that is small at infinity. We see this again via the 
transform L,;. Let H; stand for the rth member of a series of kernels developed 
from L; and defined by 


= (&(— 1)')L,. 
The properties of L; that were emphasized above assure that Hj will be a C” 
function of £ , &, --- , £. which vanishes when | ¢;| < (2n)* or || = 2. There- 
fore the kernels H; are analytic functions which decrease rapidly away from the 
origin. 
The important property of the H; is that 

cm = L,;. Also Ht" = [ Ht dez, because 

[ Hi dx; = 0, since Hi; =0 when ¢; = 0. 


The H; help us to estimate the size of 0K,/0@ * f from data on the size of 
derivatives of f. By (A6) we have 


) ae 
39 fe xf = {0 1 Do: L(Oxr, +++ Oxp)} 4 fi 25 Ga). 
Changing variables, 6c; — x;, as we did when estimating (Ke * f),;, we obtain 


0K 4/00 * f OTS L(x , “74 ae) * f(x1/6, an , &n/6)| 
a” De {Li(ai, +++, tn) * f(a /0, - ++ , x,/6)} 


= a > {Hi » 5 Xn Fe [f (21/0 a) r4/@)} 
gq” > (Hila »7 77 y In) * gq" a(x rae 


gt Def He IG ca f(x /A, - 24/0}, 


flas/0, «-- 4/0} 
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By the series of equations above we have expressed (0K¢/06) * f entirely in 
terms of rth order derivatives of f. These equations are, of course, invalid unless 
the rth order derivatives of f are continuous, which we assume. They lead to an 


inequality: 
2 a > of | i i 
< o ° eee . f a a ® f 
ag He “f < @ max ‘| dat ¥ max | H; | day aALn 


< 0,0" max” f. 
Since a derivative of (0K»/d0@) « f is the same as (0K,/00) convoluted with the 
corresponding derivative of f, we can also say 


(gx * f) < C, 0" max (f,,) 


< 0,07 max” f. 
Extending this principle, we obtain 
max” E Ko * f] < 0,607 max" f, or 
(A8) 06 


< C., 6° max" f. 
This is valid only for ¢ = s, so far as we have shown. 
Actually (A8) is also valid for ¢ < s, if appropriate constants C,, are used for 


the negative r values. We don’t need the L; to see this. Beginning with (A6), 
we obtain 


(3 x.) * f = ii L(62, , a g02,) * f(a, =" in) 


a” 6) r+n—1 or 
~ ° | 2K = ee eee 7 ie ee), 
at E 9 * f] 6 a(x,)* ( U1; ’ 6x ) * f(x ) 
Making the change of variables 6x; — x; , the right hand term is 
gr = L(y, +++ 5 2) *f(2s/0, ~-+ , 20/8), S C6" max |f |. 


Again we can generalize by replacing f by (0°/dz;)f. And we could deal with mixed 
partial derivatives. The general result would be 


max” | 4 Ko m f] < C_, | is max” f. 


This corresponds to (A8) with —r in place of r. So we see that (A8) holds with 
positive or negative r or with s S$ fors > 1. 


Smoothing on a manifold 


Let the manifold 9t be compact and analytic in the strong sense, so that it has 
an analytic imbedding ®t in a euclidean space #”. We can find a surrounding 
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neighborhood, Jt, in ZH” of R; and Jt can be such that for any point x in Yt there 
is a unique point y(z) on ® which is the point of R that is nearest to x. Also 
y(x) can be an analytic function throughout 9.’ 

Now let 

saa (= = x to =) 

where y is the C™ function defined before above (A1) and « is a small constant. 
If we assume ¢ is sufficiently small, g(x) will be a C” function throughout Jt and 
will vanish at all points near the boundary of Jt. Assume also that the definition 
of v(x) is extended by making it vanish identically outside Jt. Then g(x) is C™ 
everywhere. 

Now if f(y) is a function defined on ® we define an extension f(z) to EH” by 
putting 


f(z) = o(z) fy@)), for re, and 
fiz) = 0, for xe. 


This extends f(y) to a function f(z) which agrees with the original function on # 
and has the same degree of differentiability. 

The method of smoothing is simple. Beginning with f(y) on # one extends to 
f(z) in E”. Then Ko «* f(z) is the smoothed function. The final (a logical formality) 
step is to restrict the definition of K» * f(z) to Rt and again have a function de- 
fined only on &. But we must do more than simply present this definition of 
smoothing on #; we need to know how it affects derivatives of the function 
smoothed, etc. ‘T’o do this we need 


A standard size concept for derivatives 


9 has an analytic metric induced by the imbedding in E”. So at each point 
p of 8 we can set up an internal system of geodesic normal coordinates. This 
system is not unique, but is unique up to orthogonal transformations. At p we 
can measure the size of the derivatives of order r of a function by considering all 
the systems of geodesic normal coordinates at p. We define size”,’ f as the max- 
imum over all these systems of the maximum of the absolute values of the var- 
ious rth order derivatives of f with respect to the coordinates of that system. 
Then we call size’ f the maximum of size™,’f over all points p of 9. 

We need to know how the measure size” f(y) of the sizes of the derivatives of 
f as a function on & is related to the measure max” f(x) of the sizes of the de- 
rivatives of the function extended to E”. And when f(y) is obtained by restricting 
the range of definition of a function defined throughout EH” we need to know how 
the sizes of the internal derivatives will be related to the sizes of the derivatives 
with respect to coordinates of EZ”. In the first case it is fairly easy to see that there 
will be general inequalities of the form 


(AQ) - max f(x) < > ino By, size f(y) 


1 The existence of such a neighborhood ¥t is shown in Lemma 1 of [16]. 
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where the coefficients are constants determined by the imbedding: of #t in E” 
and the function ¢ which was used in the extension of the function f(y) defined 
only on & to the function f(x) defined on E”. 

Similarly, when a function g(x) defined on E” is specialized to a function, say 
g(y), defined only on ® there is a conversion from bounds on derivatives with 
respect to the E” coordinates (max) to the internal measure of the size of 
derivatives. This has the form 


(A10) size” g(y) S > vino Di max™ g(x). 
Actually, D> = 0, except for the trivial case r = 0, where D? = 1. The con- 
stants D, depend only on the imbedding §. 

Effect of smoothing on a manifold on derivatives 


We are now ready to see how smoothing of a function on a manifold acts on 
derivatives, etc., and relate this action to the size of the original function and its 
derivatives. Suppose f(y) was the original function. Then smoothing proceeds 
thus: 


(a) fly) > f(x) — by extension to E”, 
(b) f(x) > g(x) = Ke + f(x), 
(c) g(x) > gly) _ by restriction to Mt. 


We call g(y) = Sef(y) so that here Sp stands for the total operation of smoothing. 

Corresponding to the two general inequalities, (A4) and (A8), for smoothing in 
Ek”, we obtain two general inequalities for smoothing on #. For example, (AQ) 
gives us bounds on max‘”f(x) from size data on f(y). Then (A4) gives us max” 
data on g(x) from this data on f(z). Finally (A10) gives us size” data on g(y) 
from the max” g(x) data. The outcome is a bound on size” g(y) in terms of 
size f(y), size” f(y), ---, size f(y). If we use Sef for g(y) and f for f(y), 
and if we weaken the form of the bound by using the maximum constant in- 
volved, we get a bound of the form 


(A11) size’[Sef] SS Hys6"° dsj size”, fordéZ2l,rZs. 

Exactly analogously we obtain from (A8): 

(A12) size | © Sef | S Jn 6° Dito size” f, for @ = 1. 
06 


r-—s 


We use the restriction 6 2 1 so that we can majorize lower powers of @ by 6 
or 6” *’ in the two inequalities. The H,, and J,, coefficients depend only on the 
imbedding ® and ¢. 


Smoothing of tensors 


Here the first step is to express each tensor in a normalized (non-tensorial) 
form in terms of a set of scalar functions defined over all of . This corresponds 


This content downloaded from 
76.147.6.157 on Fri, 05 Jan 2024 01:35:50 +00:00 
All use subject to https://about.jstor.org/terms 


IMBEDDING OF RIEMANNIAN MANIFOLDS 29 


to using a specific redundant coordinate system on §t in order to have a co- 
ordinate system without singularities. Let 2’, 2°, --- , 2” be the coordinates of 
E” and let wu’, u’, --- , uw” be any local system of coordinates in §, such as one of 
the family of geodesic normal coordinate systems used in defining the size” 
concept. 

The imbedding defines a transformation 


(A13) (wi, weer, Ww) > GY, 2+, 2”) 


and the nearest point function y(x) defines a transformation x — y(x) in the 
neighborhood Yt, which gives a transformation of coordinates 


(A14) (x, cna | x”) = (u’, uw, Pe 6 u). 


Both transformations are analytic and their composition in either order is the 
identity on RR. 

Now suppose 7'75::1 is a tensor on R referred to the coordinates 
u,u,---,u’. Then we define 


Bie 8" oe Bub dx* dz! 
with the summation convention operating, and with dz°/du*, etc., taken from 
(A14). This definition has the correct invariance properties, so 3j7::: is defined 
globally on ® and is completely independent of the coordinates u', u’,--- , wu’ 
through which it is obtained. 
Because the composition of (A13) and (A14) is the identity, it is easy to see 
that the reverse conversion 
” j. OU" OUP ax” dx’ 
(sae = 4 fe eee Pegg Oh On a! Cea 
u aa Ox’ Ox! du du® 


yields the original tensor again from the normalized form Tit... . 

The smoothing operation for a tensor consists of three steps: (a) conversion to 
normalized form, (b) smoothing of each component of the normalized form by 
Se, and (c) converting the result back to a tensor on ®t via the reverse con 
version. | 


Derivative size concept for tensors 


If 7 is a tensor we consider the standard local coordinate systems defined 
before and at the center of each system we consider all the derivatives of order r 
of each component of 7’. The maximum of the absolute values of these is then the 
local size of T. Then the maximum of this over all the standard systems is 
size” T. 

Associated with the conversion of tensors to normalized form and with the 
reverse conversion there will be conversions from size” 7’ measures to size” 
measures on the components of &. These will give us inequalities quite anal- 
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ogous to (A9) and (A10). By combining these inequalities (we shall not bother 
to write them out) with (All) and (A12) to estimate the effect of S» on the 
components of £, we can obtain the analogous bounds for smoothing of tensors 
via the process T — &T — Se — ST’ (definition of ST). These bounds are 


(A15) size (SeT) < L,6"* De size? T 
forr2s3s,9¢21 and 
(A16) | size” (4 s,)r | < M,, 0° Yj size” T for 6 = 1. 


Note that the process for smoothing tensors preserves certain simple properties 
that a tensor may have, such as symmetry or skew-symmetry, etc. 


Concluding remarks 


The size’ concept is suited to routine estimations concerning sums, dif- 
ferences, or products of functions or tensors. For example, 


size (fg) S (size f)(sizeg) + (size) (size 9), 
size (TiS) < v size (T?) size (S?). 


In the second estimate we invoke the summation convention and vy should be the 
dimensionality of 9t. These remarks are made to prepare for the frequent use of 
such elementary estimations in Part B, where we shall not specifically mention 
these properties of size in the instances where they are used. 

Another general bound referring to the action of Ss is derivable from (A15) 
and (A16). This is 


(A17) size (T — ST) < N,.6"° Diiosize’T fors = 7r,6 = 1. 


In the case s = r this is derived trivially from (A15). When s > r we use 


70 
T—&,T = [, (3.s.) 1 ao 
and apply (A16). 


Our more or less elaborate development of Ss was undertaken to give us a 
smoothing operator for which (A15), (A16), and (A17) would hold. Without 
care in the definition one would obtain a weaker set of bounds. (A17) would 
probably have 6™*°~*™ instead of 6°. (A15) would probably not be weakened. 

There is a good heuristic interpretation for an operator such as Sz. It is a low- 
pass filter which passes undiminished all frequencies below @. Above 26 it cuts 
off completely. Between @ and 26 there is a variable attenuation, decreasing with 
increasing frequency so that the characteristic of the filter for all frequencies is 
a C” function of frequency. 
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Part B: THE METRIC PERTURBATION THEOREM 


The perturbation device 


The perturbation process developed in this part of the paper is based on a 
method for finding an infinitesimal change in the imbedding of a manifold that 
will effect a specified infinitesimal change in the metric induced by that imbed- 
ding. The smoothing operator S» of Part A is used in connection with this method, 
the ‘‘perturbation device’’. 

Let St be a compact n-manifold smoothly imbedded in £”. Let the Gries 


coordinates of E™” be 2, 22, --- , 2m. Referred to a set x, %2,°-: , Xn, of local 
coordinates in Yt, the metric tensor induced by the imbedding is 
OZa OZ 
Bl Wo a puis a P 
a ga = 2 ax; Ox; 


We can consider perturbations of the imbedding as rates of change, measured 
with respect to the change of a parameter. The parameter is unspecified and we 
indicate the rate of change of any quantity by placing a dot over it. Thus we 
have 


OZ 02a OZ OZ 
(B2) oe Ox; Ox; + Que Ox; Ox;’ 


which follows from (B1). 

We want a method by which we can determine {2,} satisfying (B2) when 
{gi;} 1s specified. We can make this problem simpler to solve by adding another 
condition to be satisfied by the rate {2.} of change of the imbedding. This is 


(B3) >. we a) for all ¢, 


which requires the perturbation {z,} to be normal to the imbedding. 
Consider the result of differentiating (B3) with respect to 7; : 


. 2 
(B4) pp a = — doa e me dy 


OX; OX; 


The left member occurs in (B2) and the right member is symmetric in 72 and 7 
(since the imbedding is to be reasonably smooth). Consequently we can use 
(B4) to modify (B2) and obtain 


: a” oy « 
(BB) jee a2 es, 


This is the condition the perturbation {z,} should satisfy to effect the metric 
perturbation {g;;} when (B3) holds. 

Now we have a much simpler type of requirements relating {2.} to {g;;}. 
Together (B3) and (B5) form a system of linear equations to be satisfied by the 
Za, Whereas before we had partial differential equations in the 2. . 
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How and when can we solve (B3)-(B5) for {2.2} after {g:;} has been specified? 
The number of variables 2, , which is m, should be at least as large as the number 
of linear equations, which is 4n” + 14n, taking the ¢, 7 symmetry into account. 
Probably m will have to be larger than in? + 13 to insure that the equations 
are not singular at some points of J. So the equations will probably be under- 
determined. 

In Part C we construct an imbedding of Jt such that (B3)-(B5) is everywhere 
non-singular. Here in Part B we assume that the imbedding has this property and 
simply make our results conditional upon this. 

We must find a way to select a particular solution {z.}, and do this in a smooth 
way, when the equations are underdetermined. A very simple requirement, 


doa (2a) = minimum, 


(B6) 
subject to satisfaction of (B3—B5), 


selects a particular solution in a satisfactory manner. {2,} will have the same 
degree of differentiability as {g;;}. 

The geometrical interpretation of the effect of (B6) is that it selects the nearest 
point to the origin in the plane of solutions {2,} of (B3)-(B5). We can also study 
the effect of (B6) from a more formal viewpoint. The system (B3)-(B5) is of 
the form: 


(a) cal eee — Pu- 


If we assume a solution in the form 


(b) go> Oe 
then the d,’s must satisfy 

(c) Dia Cured: = Op: 
Let 

(d) Ei = Da Crain 
then (c) becomes 

(e) Les = On 


This last equation (e), will be non-singular if det || E,,, || is not zero. However this 
is Gram’s determinant for the matrix || C, || and it cannot vanish unless || Cy || 
has less than maximal rank (3n’ + 12n). Because we are assuming that (a), 
which is (B3)-(B5) in a condensed notation, is non-singular, we know that 


rank || Cue || = 4n’ + 14n. So (e) is non-singular. Because (e) is not under- 
determined it has a solution in the form 
(f) d, = | Eu | -_ {eu}. 


Now from (b) we can express a special solution {22} of (a) (or of (B3)-(B5), 
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which is the same thing) in the form 
(g) ga = || Cow ||| Zw |] + {eu} - 


This special solution of (a) happens to be the one for which >). (2.)” is min- 
imized. Suppose {2,} is any other solution of (a). Then 


(h) Ds Cue (24 a 22) = 
We can write 
(i) You a) — Yow BP = Yon ba — BY +2> 50 88(22 — 25), 


and the last term vanishes because 
2) aecey 20) S 2D al eel Ge 
= 20), d, da Cre (da — 22) 
2>- d,-0 


= 0. 


(j) 


Here we employed the expression (b) for 22 . 

Now that we know the last term of (i) is zero it follows that the right hand side 
is positive and > \4 (2a) > >. (2). Thus the special solution of (B3)-(B5) that 
is given by (g) is the same as the one selected by (B6). This shows that (B6) 
determines a solution which is a well behaved function of {g;;} and the deriva- 
tives of the imbedding functions, so long as (B3)—(B5) remains a non-singular 
system. 

The solution of (B3)-(B5) determined by (B6), or equivalently by (g), has 
the following form of linear dependence on {g,;;} and analytic dependence on the 
imbedding derivatives: 


(B7) fa = Dies do Pow (55) a5: a 
= isi Jij f aij 0x,|) ? | Ox_ 021) / © 


{gi;} is represented in (g) by {y,}, so this shows that {z.} depends linearly 

n {g:;}. The F.;; are analytic functions of the first and second order derivatives 

of the imbedding functions, so long as these are such that (B3)—(B5) is non- 
singular. 

To recapitulate, (B7) iicieaies the form and behavior of the solution of the 
system: 


OZa. 
Oza 3 = Gs 
(B8b) =2 Des Ox, ja. = 9ij 
(B8c) doa (24) = minimum, subject to (a) and (b). 


The solution is an imbedding perturbation rate {2.} which leads to the rate 
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{9:3} of change of the metric induced by the imbedding. We call this method of 
determining a perturbation rate {2,} the “perturbation device’’. 


Notational conventions 


The work below becomes almost entirely the treatment of a problem in 
analysis, so a different (condensed) notation is appropriate. We shall drop the 
coordinate indices generally. Thus 


2 


{Za} becomes 2, {2.2} becomes 2, 0% becomes 2’, oe 
Ox; Ox; OX 


nie becomes 2”, etc. 
We now write (B7) as 
(B9) 2= F(z’, 2”) lg. 


XX indicates the (contracting) tensor product acting between F and g. (B9) is 
the solution of 


(a) zZozg =0 
(B10) (b) —Q2”" 02 = 9 
(c) |2| = minimum, subject to (a) and (b). 


Here o indicates the scalar product, summation over the index a of E”. We also 
have 


(B11) G = 22 @2 


as a modified condensation of (B2). Here @ is a symmetrizing tensor product 
accompanied by a summation (as with o). 

We shall deal with many inequalities on the sizes of functions and derivatives: 
Generally these will be tied to the parameter 6, which controls the smoothing 
operator (see Part A). @ will play a dual role, being both the parameter of smooth- 
ing and the parameter of the process. Dot will denote 0/006. [e.g.: 2 = 02/06]. 
The process will begin with a specific value of 6, called 6) , and end with @ = o. 

Our canonical notation for bounds on the sizes of functions and their derivatives 


is explained by illustration: 
T < Kl6|~2: 4] 


oO 


indicates a whole system of bounds on the tensor 7 and its derivatives measured 
in terms of the size™ concept of Part A (think of 7 as varying with 6), which are 


size? T < Ko” 
size? T < Ko" 
size” T < Ko” 
size? T < K 
size” T < Ke 
size” T < Ke’. 
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In general, if the symbol is [6 | 7’ 3], where r and s are integers and0O Sr S&S s, 
p < q, the exponent of 6 is p for size”, size, --- , size. Then from size” to 
size” the exponent increases in arithmetic progression in such a way that we 
have 6 for size. Almost always this increase will be one unit for each increase of 
the order of differentiation, as in the illustration. In other words, we usually 
have g — p = s — r, and also gq and p are usually integers. 

Most often we can abbreviate [6| 7?’ ¢] to [?' 3] and understand that 6 is 


involved. Also we shorten symbols such as [3° 3] to [3]. 


The perturbation process 


As was remarked above, this process uses the perturbation device and the 
smoothing operator. It also uses ‘‘feed-back”’. 
The equations defining the process are listed below: 


(B12) ¢ = Sez 
(B13) 2 = F(’’,¢’) RM 
E=M-—-gqg 
(B14) 
=2¢- Je? 


(equivalence shown below) 


(B15) u(p) = a special C” function, nondecreasing everywhere and monotone 
increasing for 0 < p S 1. Also, u(p) = O for p S 0 and u(p) = 1 for 
p = 1. To be specific think of u(p) = ¥(2 — 7p), where y is the special 
function of Part A. 


(B16) L(6) = [ E(6)u(o — 6) dé 


G = the desired total change of the metric tensor (a symmetric co- 
variant tensor). 


(B17) 


(a) [ M(6) dé = ul6 — %)SeG + Se L(6), or 


(B18) (b) M = ul — &)SeoG + uld — O)SeG + So L + SoL, or 
(c) M = u6 — &)SeG + ud — %)SeG+ Se L 


6 
+ i E@)u(o — 8) db. 


Interpretation of the quantities and equations 


Each of the quantities has its interpretation, but our interpretation may seem 
unilluminating to many. ¢ is a smooth approximation to the imbedding. M is the 
rate of metric change that is being “attempted.” Since the actual rate, z, of 
perturbation of the imbedding is computed from a formula which is like (B9) 
but has ¢ in place of z, we cannot expect that the actual rate, g, of metric change 
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will be the same as M. If the imbedding function were ¢ instead of z then g 
would be M. That is, (B13) gives the correct rate of imbedding perturbation to 
accomplish the metric change M, provided ¢ is the imbedding. This implies that 


(B19) M = 2’ ® 2. 


E is the error rate, or the excess of the attempted metric change rate over the 
actual rate, so H = M — g. Since we have (B11), 


g = 22’ @ 2, 
we can subtract this from (B19) to obtain the alternate # formula: 
E = 2(¢ — 2)’ @ 2’. 


This formula is the more useful one for estimating the size of E. 

L represents accumulated error. It is not the total accumulated error, but it 
includes all the error incurred up to @ — 1. Between @ — 1 and @ it includes only 
part of the error. Effectively, there is a lag in the inclusion of error in L. This 
effect is accomplished by the weighting function u(@ — 6) in (B16). It is not 
really necessary to define Z so elaborately. It could be the total error. The 
advantage of the more elaborate definition is that it ultimately makes it easier 
to view the process as controlled by a set of very tame integral equations. L 
has the simple integral expression 


i I E(8)u(o — 8) dd. 


If L were the total error 1 would be #. Then it would seem that M was defined 
from # and # from M. We avoid this complication with the definition we use. 

The definition of M is based on the principle of “feeding in” the smoother 
parts of the desired metric correction first, saving the rougher parts for later. 


6 
Referring to (B18a), we can consider / M the total ‘‘attempted’”’ metric change 
TY 


from the beginning of the process at 4 to the current situation at 9. This is set 
equal to Spl + w(@ — 6)SeG. So it is the smooth part of L plus the smooth 
part of G weighted by the coefficient u(@ — 6). The reason for attaching u(@ — 6) 
to SeG in this formula is the simple one that for 9 = 4 both sides of the equation 
(B18a) should be zero. At the beginning of the process a finite portion of G, 
specifically S»,G, is considered smooth enough to be ‘fed in.’’ But it must be 
fed in gradually, so we use u(@ — 6) to make the process start gradually. For 
6 => 6 + 1 this factor u(@ — 6) is just +1 and is irrelevant. 

We can see how these definitions should work out if the process is convergent. 
The total metric change accomplished by the process from its start at 4 to the 


limit as 6 — © will be | g dé. From (B14), 
69 


(B20) g= M — #, therefore 
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(B21) [ a= | w- E 
09 Th) 69 


I 


iOS. G8. Lay = [x 


G+ L(o) — L(o) 
= G. 


To put L() for [ & requires the assumption that L — 0 as 6 — o. (B21) 
09 


verifies the general design of our ‘“‘feed-back”’ process, but of course the main task 
is the proof of convergence. These remarks on interpretation should not be 
regarded as if presented as proofs. However, we shall use the equivalence of 
the two formulas of (B14). 

To prove that the process works we first derive a set of appropriate a priort 
bounds on the quantities involved which would be satisfied if the equations 
defining the process have a solution up to some value 6, of 6. Second we prove 
a local continuation theorem about solutions of the equation system. The com- 
bination of the bounds and the local continuation theorem gives us the existence 
and uniqueness of the solution for all values of 6, but we assume that G is suffi- 
ciently small and that 6) is properly chosen in obtaining this result. 


The estimates 


These estimates, or bounds, form a self-interacting system because the size 
of each function tends to depend on the sizes of the others. We assume the 
system (B12) through (B18) has a solution for 6) S @ S 6; and assume that the 
quantities #, M, ¢, z, L, etc., satisfy certain bounds in this range. Then we 
compute new bounds on the quantities from the defining equations. Finally we 
show that when G is sufficiently small and 6) properly chosen there exists a set 
of bounds which is satisfied by the initial values of the quantities and is such 
that the rederived bounds computed from the defining equations are all smaller 
than this original set of bounds. 

The first bound is 


(B22) c—a< ef. 


zo stands for 2(60), the initial value of the imbedding function. We assume that 29 
is analytic and such that (B8a, b) is non-singular. F(z’, 2”) will be well behaved 
when z, 2’, 2” are near 2 , 20 , 20 . So (B22) is designed to insure the good behavior 
of F(¢’, ¢”). © must be a sufficiently small constant so that F(¢’, ¢”) will be well 
behaved when (B22) holds. 

There will be some value 6, (assume 6, = 1) such that for 6 = 6, we have 
Sezo — 20 < €/2[2]. So we make a requirement: 


(B23) 65 2 Oa. 
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With this requirement we can satisfy (B22) by keeping z — 2 small enough. 
Let 

(B24) Zo <, afaial. 


This is for notational convenience only. 29 1s of course independent of @. The 
bound a should be chosen so that (B24) holds for all 6 = 1. For z — 2 we as- 
sume a bound 
(B25) 2— Zz X Bll. 
Note that if (B23) holds and 6 is small enough then (B22) holds. Adding (B24) 
and (B25), 

2S (at Ba 
(B26) as 
XS asia). 


£ is employed for notational advantages. 
Other bounds used are: 


(B27) L < Nl 
(B28) M <ul ors 
(B29) 2 <r ota) 
(B30) E < ol 
(B31) G < 43]. 


The bound on G isa handle by which we can refer to the size of G, which we shall 
assume to be as small as necessary to make the process converge. This smallness 
includes derivatives up to the third order. 


The rederived bounds 


Now we assume that (B22) through (B31) hold for a solution of the equations 
of the process for 0 < @ S 6 and use the defining equations to compute new 
‘“rederived” bounds on the same quantities in terms of the original bounds. The 
new bounds hold in the same @ range and are distinguished by starred Greek 
letters. 


First consider L. From (B16) 
6 — —_ — 
1(0) < [ nl0|~$-3u(o — &) aa 
J 06 


When k 2 2 
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Since |u| S 1 and 6 = 1 we can put 
L < nl00| 073] 
(B32) S 160" [3] 
< \*E3). 


To estimate M we use (B18c) and have 
M < w(0 — 6)Cr6[3'4] + Cod[-0'4 

(B33) _ a 
boa ret + sf [3 | “$31 w'(o — 8) a. 


The first term is obtained from (A15) applied to (B31). The constant C; is simply 
the largest of the coefficients that would come in from (A15), In general, when 
we have unspecified constants appearing in estimates we shall simply number 
them C,, C2, -:: in order of occurrence. We shall not be concerned with the 
actual sizes of these constants. The second term comes from applying (A16) 
to (B31), remembering that | u| S 1, and the third comes from (A16) and (B32). 
For the first term we say 
(8 — O)Cr8[3'4] < [6°u(8 — 60) ICS ora] | 
SX maxeygogoots {O°} maxe {u(8 — 6)}Crdl ol 
< C3(00 + 1)*o oul. 


This brings the first term into the [0's] form, leaving only the fourth term of 
(B33) to be treated. 

We want the fourth term of (B33) majorized by a [0's] term. Let 6* = 
max (6), 4 — 1), then since «’(6 — 6) = 0 when 6 S 6 — 1, we can say 


6 
the fourth term < So i n{6| 0:3] u (@ — 6) dé 
& 


SSo{n max » [u(p)] [6* | ~3°73]}, 


because the interval of integration is not more than one unit long, 1.e., 6 — 6* < 1. 
Continuing, 


the fourth term < Can[6* | ~0" 4], 


where C, is to take care of max, w(p) and the constant coefficients arising from 
(A15) (which tells us how So acts). Then, since 6) S 6* S 6, we can say 


the fourth term < Csn00"(6*/0) [8 | ~0'4l 
XS Can6o" (4) 18 | “or 
(because 1 S 6* < 6, and 6* = 6 — 1) 
< CsnO0'{-0' 4]. 
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Combining these results we obtain 
M X {C3(@0 + 1)*6 + Co6 + Cor* + C500 nt { ora] 
< u* Tor. 


~N 0,4 


(B34) 


To estimate z we need (B34) and an estimate on ¢. For ¢ we write 
(B35) t< Cele), 


by applying (A15) to (B26). The reason for extending this estimate to sixth 
derivatives is that z depends on ¢”, so that the fourth derivatives of z depend 
on the sixth derivatives of ¢. 

A derivative of z will depend on derivatives of various orders of M and ¢. 
For example, we can write symbolically 


= FO, 0) WM! + Pe)" BM + (Py) 0" BM. 


The function F and its partial derivatives Fy, and F;-, will be bounded when 
(B22) holds so we can say 


size 2’ < const. u*6@* + const. Csfu*O ~~ + const. Ceéu*O 


by applying (B34) and (B35). Notice that the highest (least negative) power 
of 6 comes from the term where J is differentiated. For a higher derivative of z 
there would be many terms and the highest power of 6 would appear in the term 
with the maximum differentiation of M. If we majorize the lower powers of 6 by 
the highest and observe that u* would appear once in each term we can put the 
general estimate for z in this form: 


2 < Pi(é)e*{o'4), or 
<y* 0% 


rw 0,4], 


(B36) 


where P,(£) is simply some fourth degree polynomial in é, the first of a series of 
such polynomials that we shall use, analogous to the series of numbered con- 
stants. Note that z has the same 6 dependence as M. 

We can use (B36) and an estimate 


¢ — é = Cré{ 04], 


obtained via (A17) and (B26), to estimate #. Using FE = 2(¢ — z)’ ® 2’ we ob- 
tain 


E <2{CieC oral}! @ fy*T oul’, or 
E < 2C:é o's] @ y*[_ o's], or 
(B37) — 
EB ns Cs&y* | 0, 3 
<n*fo' 3 


This illustrates the pattern of the bound corresponding to the product of two 
bounds expressed in our notation. 
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The estimate of z is the most laborious. The simple estimate obtainable 
6 
through z2 = z + | 2 is not good enough for the third derivatives of z. We 
99 
need an estimate on integrals of M as an intermediate step. Here also the direct 
estimation is inadequate, but we can use (B18a) to say 


63 
| M(6) dd = u(63 — 6)Se, G+ So, L(6s) — u(O2 — O%) Se, G — So, L(62). 


Applying (A15) to the G and Z bounds, assuming 62 S 6;, and using |u| < 1, 
we can say 


63 
[MO a0 < Oxo + d*) [65 | 8:4. 
2 
The straightforward integration of the M estimate, (B34), yields 
93 63 
[ mows] wttiae 
8. Bo 
X< Cr u* [62 “ok 
This estimate is stated for differentiation only up to the second order because 


for the third order i 6* d@ would lead to a logarithmic term; and for the fourth 


order 63" would majorize 6;° rather than the 6 term majorizing the 63; term 
(which occurs up through the second order derivatives). 

Now we have two estimates, one good for lower derivatives, and one good 
for higher ones. If we add the two estimates we can safely extend the range of 
the second and modify the first in the range covered by the second. This gives us 


(B38) i "M (6) do <Cu(u* + 6 + A*) {[62 | W014) + [65 | oral}. 


We also need an I’ estimate in obtaining our z estimate. Symbolically written, 
Fo= Fol! + Fyek’. 


To use this we must estimate ¢, which is (Ssz) = Sez + Sez. So from (A16) 
and (B26) and from (A15) and (B29) we derive 


& XS Cre{ o's] + Crsyl ore] 
<Culé + y)[oel - 


Using this ¢ estimate we can now estimate F’ in a manner exactly analogous to 
the manner in which we estimated z = F & M. The result has the form 


F< P(E + vod + PalOE + 83 
(B39) ~~ 
< P(E + 23). 
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Finally, we need an estimate on F itself. Here the highest power of @ will 
come from the maximum differentiation of ¢”. So the estimate takes the form 
(B40) F< Pes) (0 
and has the same @ dependence as ¢”. 


The z estimate 


We actually estimate z — zo, then z is easily estimated from this result. First, 


G3 
2(03) — 2 = | 3 do 
0 


63 x r) 93 ) 
= — F a M dé } dé. 
80 (3 8 


Now we apply integration by parts and have 


03 09=63 Os O3 
20) -a=-|F af M do + Pa(y M do) di. 
92 G2 


bo=6¢ 69 
The first term can be evaluated and we obtain 
z(03) — % 


(B41) = F(§"(), ¢" (6) @ | * Mt a8 + | {Pet 6.) & | ” Mt aeh dB... 


At this point we insert the estimates (B40) for F, (B39) for F, and (B38) for 
| 1 a0. This yields 
(B42) z— 2% 

S Ps(¥)[O0 | 114] BI Cu(u* + 8 + A*){[G0 | ~o4] + [65 | O41} 


+f {P(E (E + A) [Oe | 72:71} BW Cu(u*® + 6 + A*){[O2 | ~o'a] + [6s | ~o4l} doe. 


Call the two terms on the right 7, and J.. Using 63 2 6) 2 1 we can weaken 
and simplify 7; and have 


Ti X PolE)(u* + 5 + A*)[45 | 374]. 
T. must be handled with more care. Each derivative of 7. would correspond 


to the sum of several integrals involving various powers of 62 and 63 in the 
integrand. For the rt order derivatives of 7’, there are terms of the form 


63 
i constant - 6: - 03 °° d6e, 


99 
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where r = 0, 1, 2, 3, 4 and either s = r — 5 (which makes r — s — 5 = OQ) ors 
satisfies —2 < s < r — 2. These two alternatives correspond to the two expres- 
sions [62 | sy and [6;| 04] which are added in 7,. These integrals give 
varied terms and we can handle the situation most clearly by simply listing all 
cases. (This is done in Figure 1.) 


r= 0 r= 1 p= 2 = 5 r=4 
ser-—§ | 195° | 165° 195° | Oo log (63/60) 
s= -2 | 665° 65° 03" 60 63° 60" 60" 83 
“i 1 1oe(65/6)65° | 10@(65/)6s° | log(@x/8)6' | log(@:/6) 
0 | 03° 63" 1 
Lo 105 } 
a L a 
majorizer | 65° 65° 65° a 60 63 
Fra. 1 


By using the majorizing terms listed at the bottom of the chart we can say 
Ts X PoE + y)(u* + 8 + d*)60'[4s | 3:4 


Because the pattern of powers of 6) does not fit into our notation scheme we 
have simply used the highest power (4°) which occurs. 
Now if we add the 7; and 7’, estimates we get an estimate = _ — 2%: 


2(43) — 20 X Ps(€)1 + & + y)(u* + 6 + A*)[63 | 3; or 
x => 20 = B* [34]. 
Since 2 = 2 + (2 — Zo) we can say 
25 (at 8*)[5%4 
< ea). 
To conclude the rederivation of bounds we must consider the Requinoucns 
(B22). We have stipulated 6) 2 6, so that 
Sezo — Zo S &/2hp]. 
Applying (A15) to (B48) we see that 


¢ — Seo = Solz — Zo) S Crs8*(0'A). 


(B43) 


(B44) 


Adding inequalities, 
£ — 20 = (¢ — Sezo) + (Sezo — 2%) S (CisB* + e/2)[5] 


e*[3]; 


A 


(B45) 


A 


we hope, of course, that e* S e. 
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Strong consistency of the bounds 


We verify here that if 6 is sufficiently small we can choose 4 so that 6. 2 4. 
as required and so that all the rederived bounds are smaller than the originals: 
A* <A, u* <u, e* < e, etc. For clarity the formulas for all the rederived bounds 
are assembled here: 


A* = 700° (B32) 
u* = C3(0) + 1)°6 + C26 + Cor* + C5607 (B34) 
y* = Pilé)u* (B36) 
(B46) n* = Ceky* (B37) 
B* = Ps(E\(1 + & + y)(u* + 6 + d*) (B43) 
ge =a-+ ps (€ = a+ 8B) (B44) 
e* = ¢/2 + Cy6* (B45). 


Consider any set i, u, y, 7, 6, &, € (with — > a) of positive original bounds and 
consider the behavior of the rederived bounds as 6) —> © and as 6 — 0 for each 
6) value. That is, consider 


limg 0700 lims+o 


of each rederived bound. 
First we see that 


limo, {limssy A*] = 0, 


or \* — 0. Since A* — 0 we can see that u* — 0. Therefore y* — 0, whence 
n* — 0. Because p*, 6, \* — 0 we see that 8* — 0. Therefore &* — a and e* > &/2. 
These observations make it clear that, regardless of the sizes of the bounds 4, 
mb, ++ assumed originally, if is taken large enough and 6 is sufficiently small 
the rederived bounds \*, u*, --- will all be smaller than the original bounds. 


Existence of the solution 


To use the results above we need a general understanding of the equation 
system defining our perturbation process. We must know there will be a local 
continuation whenever we have a well-behaved solution of the system in an 
interval 6) < @ < 6,. Actually, the equation system is a very innocuous one. The 
smoothing makes the solutions analytic functions and makes them very well 
behaved, at least over small ranges of 0. 

The form which the system takes above, as (B12) through (B18), does not 
reveal its true character very directly. The extent of the taming effect of Sg 
is not fully apparent. Another formulation, given below, makes this tameness 
apparent. 

A removable aspect of the system as presented above is the differentiation 
with respect to spatial coordinates, which is indicated by priming, as with ¢’ 
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or 2’. This differentiation makes the system look like a partial differential equa- 
tion system, a type of system where local continuations of solutions often do not 
generally exist. 

If a quantity is defined by smoothing, such as ¢ = S,z, then its space derivative 
can be expressed as the result of an appropriate operator’s action on the original 
quantity. So let us say 


c’ = Sez. 


For higher derivatives we have analogous operators Ss , Se’, etc.; and for 
S» there is a similar series §9, So, ---. These derived operators also have 
smoothing properties. By using them we can recast the system as an innocuous 
functional-integral-equation system. 

Regard z, 2’, L and I as basic quantities. Then the others are expressible as 
functions or functionals of these four, either direct or indirect: 


C= Sz, f= Sz, " =Soz, "= Sez 
M = S,(u(@ — O)G + L) + Ss(u(@ — %)G + L) 
M’ = So(u(@ — 00)G + L) + So(u(d — %)G + L) 

z= FC’, 6”) BM 

Se (Bet + Bef") RM + F RM’ 

g = 22’ @ 2’ 

E=M-—g or E= 2(e' — 2) @ 2. 


m. 
I 


The basic quantities are to be equal to certain integrals: 
6 6 

at 4 daat| 
00 80 


[ u(o — dE) d@), L= [ u(o — dE do. 


z 


DL 


Now, since all the functions, functionals, and operations involved are well- 
behaved, in fact, analytic, we have a well-behaved integral equation system. 
Indeed, the system is especially tame. Since z and 2’ are defined through smoothed 
quantities exclusively and since z and zo are analytic, therefore z and 2’ are 
smooth and analytic (as functions of the space variables). Consequently g and # 
are smooth. So L and [ are also smooth. Thus all the quantities are kept smooth 
by the presence of S, in the equations. 


Continuation 


Suppose the system has a solution for 0. S 6 < 6,. We can show that this 
solution will satisfy the bounds \, un, etc. The bounds will certainly hold at the 
beginning of the solution, so they will hold in an interval % S 6 S ¢ where 
65 < ¢ < 6, if they do not hold up to 6. But now the smaller bounds 
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A*, w*, --- e* must hold in 6 < @ S gy. Therefore the larger bounds (from con- 
tinuity) must hold in some interval past ¢. This contradiction proves that the 
larger bounds, and therefore also the smaller rederived bounds, hold in the 
whole range 0 S 6 < 4, of the solution. 

Now suppose 96, is the limit of continuation of the solution. There will certainly 
be a solution in the closed interval 6 S @ S 6, because of the bounds and the 
tameness of the system. But a standard argument, such as the Picard method 
or the functional fixed point approach, will show that the system has a local 
continuation past 6, . Furthermore, the continuation of a solution will be unique. 
Therefore we have contradicted the hypothesis and we see that the solution 
will exist, will be unique, and will satisfy the bounds for all values @ = 6. 


Convergence to isometry 


We must show that the imbeddings z(6) occurring in the perturbation process 
tend to a limit imbedding which realizes the desired metric G + go, where go 
is the metric of the initial imbedding. 

The Cauchy criterion approach to the proof of convergence of the imbedding 
requires us to consider 


92 
z(@2) — 2() , which = i 2(@) d@, which 
1 


XS vl | 07 el. 


This estimate on 2(62) — z(@,) and its derivatives is good enough to show that 
the imbedding z2(@) and its first and second derivatives converge to a limit 
z(o). The integration of the z bound is too crude an approach for the third 
derivatives. 

To check the convergence of the metric, g(@), induced by the imbedding to 
the desired limit, go + G, observe that 


g(61) = go + [ g dé 
91 
(B48) = Jo + | (M — E) dg 


6, 64 
= 9 + M dé — E do. 
69 


TT) 


Now apply (B18a): 
6y 
g(A1) = go + ul — %)Se, G+ So, L(61) — | E dé. 
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Assume 6; = 6) + 1 so that w(@, — 6) = 1. Then we can say 


g(61) = go + SG + 8, (0) — f ‘(1 — u@ — o1E@) ao 


= | Ab, — 6) E(6) db. 


The first mtegrand vanishes for 6 S 6, — 1, and the second integral is L(61), so 


gi) = g + G+ Ge, — 1G + L@)] 


(B49) 1 
re I “luli = 8) — 1100) av, 


Let us call the last two terms remainders R; and R,. Then by (A17) and (B27) 
and (B31) we see that 


Ri < Ci(6 + d)[A1 | 0's). 


rw 


From (B30), 
Re X nlfi — 1 3" 8 : 


The bound on R» is good enough for third derivatives but the R; bound is not. 
So we see that the metric g(@) converges to the desired limit metric and also 
see that the first and second derivatives converge to the corresponding deriva- 
tives of the limit metric. But we do not yet know that the third derivatives 
converge. The limit metric, go + G, is C’, of course. 


More refined results 


This section of Part B can be passed over without losing the continuity of the 
paper. Here we show that the limit imbedding is actually always C* and we 
treat the cases where G is C*, C’, --- , C®. 

The cases where G is C* are treated by an inductive method. The C” case is 
handled with the aid of the results of the C” case. We can illustrate all the 
essential features of the induction (with minimum notational difficulty) by con- 
sidering the step from C’ to C*. To begin, we can assume bounds 


G <eé[| 


20 = a’ i 


(B50) 


that have the same form as (B24) and (B31), with 4 replacing 3. However, we 
make a point of not requiring 6’ to be small. 

Assume that the perturbation process is applied just as if G were only C”. 
We will have the bounds (B22) through (B31) on the quantities involved. Our 
result will be that we can deduce a new set of bounds that are analogous to these 
except for the systematic replacement of 3 by 4, 4 by 5, etc., in the indices re- 
ferring to order of differentiation. 
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In the above work which computed the set of rederived bounds A*, p*, etc., 
wherever an estimate was derived for a quantity defined by smoothing the 
estimate could have been derived for any particular upper limit on the order of 
differentiation. Instead of (B34) we could have derived 


M < a@o'sl. 


This is not the kind of estimate we ultimately want for M@. That would be an 
[o's] estimate. This is just a useful intermediate step. Similarly we can say 


on Cee[3"7] and 
2< Wot. 


By integrating this and using (B50) to estimate fifth derivatives and by using 
(B26) for the lower derivatives we get a crude z bound: 


From this we have 
f—2X< Ck os. 
Since # = 2(¢ — z)’ ® 2’, it can beestimated from our above results and we find 
E < wor 4l. 
A weaker # estimate, more conveniently manipulated, is 
E <0 “4. 
This weakened estimate behaves nicely in an integrand and we get new L 
and I, estimates via the LZ and formulas. These are 
L < XH) 
L SS Calo al. 
Now reconsider the estimation of M and use the new L and J, bounds and the 
new G bound, (B50). One gets 
M <x wt ordl. 


(B51) 


From this we obtain 
a<70d, then 
E < ff 3, - 
This sharper £ bound yields improved bounds for Z and L: 
LX Wal 
LX Gon a). 
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Now from these and the new G bounds we can derive 


M = Le dae 5]5 then 
zZ < - é ie 515 then 
E < nor al. 


The z estimate depends on estimating z — z. This can be done exactly like 
it was done before in computing the rederived bounds. We can use a weak ¢ 
estimate, improved only by extension to sixth derivatives. This gives us a weak 
F estimate, which is however adequate because it is used in combination with 


an improved estimate on | M dé. This improved estimate comes from the 
strong M, L, and G bounds aleve (with p’, d’, 6’). The result is of the form 
z2— 2 X Blais) 
=< lus), 


and this completes the set of new C* type estimates (indicated by primed greek 
letters). 


Our result concerning the inductive extension of appropriate bounds to the 
cases where G is C” also gives us a result for the C” case. The (typical) induction 
step from the C* case to the C* case shown above, involved the use of a new G 
bound (B50). But we did not assume 6’ was small. So our results on the C* cases 
with k > 8 are really sharper than the exactly analogous results would be, 
because only G and its derivatives up to the third order need be small. 

If G is C® the result of each C" case is valid, so the imbedding is C”. 


We can show that when G is C” the kt» derivatives of the imbedding converge 
so the limit imbedding is C*. Also the kt* derivatives of the metric converge. 
The argument is the same for any value of k, and since we have not shown this 
for k = 3, we treat that case. 

The basic fact is that the limits of the third derivatives of SsG as 9 — « are 
the third derivatives of G, and that this convergence is uniform. This follows 
from the uniform continuity of the third derivatives of G, a consequence of 
compactness. The fact can be symbolized by saying 


(B52) G — SG X< Adsl, 


where Ag is a constant for each 6 value and Ay — 0 as 6 — ~. Nothing can be 
said about the rapidity of the decrease of A¢ ; the fact is purely qualitative. 

The estimate of (B51) on L, although derived after we assumed G to be C", 
did not depend on this assumption. So we can use it here, where G is again only 
assumed to be C*. By applying (A17) to it one can obtain 


(1 — Spo,)L(8) < Crrd(6)*[61 | 70°]. 
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Assuming @, 2 6,, it follows that 
(B53) Se,L(6) — So,L(8) < 2Cyrd(6)*[6: | 074). 
We shall need a bound on L(@:) — £(6@,). Assume 62 = 6; = 6 + 1, then 


116.) — 146.) = f° ule, — (0) a0 — | wo, — EQ) a 


B2 
=i {u(0» — 6) — u(6, — 6)}E() do, 
i-1 

because u(@. — 6) = u(@, — 6) = 1 for 6 <= 6 — 1 and because u(6, — 6) = 0 
for 6 = 6,. Using (B30), 


269 
L(e) — L6) < fal ss-3) ae 
B54 ae 
ae Sal — 1S 


XS 16m: | 07s). 


Bo 
We must have a more refined estimate on | M. By (B18a), assuming 
6; 


6260 2 + 1, 


92 
M do = Se, G+ Se, L(6:) — Se, G — Se, L(G) 


61 


(B55) = So, G — So,G + So[L(2) — L(6)] + (Se, — So,)L(62) 


=-T+°r4+7T.. 


Our principal concern is the third derivatives, which our previous estimate, 
(B38), merely showed bounded, not decreasing. By (B52), 


Ta BS (Ao, ea As.) [3]. 


If 6, , 62 — © the third derivatives of 7, will approach zero, which is what we 
want. | 

(B54) is strong enough to show that the third derivatives of 7; will approach 
zero a8 6, 62 — ~. But 7, requires a more elaborate treatment. We can write 


T. = (Se, — So,)L(62) = (Se. — Se,)[L(@) + {L(42) — L(6)}] 


where 6 should be regarded as smaller than 6,. Then by (A15) applied to (B54) 
and by (B53) we obtain 


T. <, Crsn[6 | 0°73] + 2Cwd(6)*[0, | 8°49]. 


Observe now that 7’. and its derivatives up to order 3 can be forced to be arbi- 
trarily small if one first chooses 6 large enough to make the first term above small 
and then chooses 6, enough larger than 6 so that the second term is small. Effec- 
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tively then, as 0; , 6. — © the term 7, and its derivatives up to the third order 
approach zero. 
Since 72”, Ts, and 7.” all approach zero as 6, , 6. —> ©, we have shown that 


iD) ‘tt 
| u do | —-O0O as 0,0.—> ©. 


8; 


For the lower derivatives of this intergal the simple estimate obtained by inte- 
grating (B28) is adequate. We can combine the two approaches in a single esti- 
mate: 


99 
(B56) , dx wl. | 0% 2] + pwe,[& | ~o’sl. 
1 


ue 1S to be a constant for each @ which approaches zero as 6 > ~. 

We show that the third derivatives of the imbedding 2(6) converge by a 
Cauchy criterion argument. The difference 2(@;) — 2(@,) can be expressed in a 
form exactly analogous to (B41), where 6; appears in place of 6 : 


20) — 20) = FOO), 6d) & | "M do (=1)) 


A [ {Flat 6.) & [- M io do. (=T»). 


The analogue of the estimate we obtained, below (B42), for 7; is not adequate 
for our needs here. But the analogous T. estimate 


To S Pol (E + y)(u*® + 6 + A*)OT [Os | 374 


is quite good because 61° becomes small as 0 , 6: —> ©. 
The estimate (B56) was obtained so that we could handle 7; . We use this 
with (B40) for F and obtain 


Ts Po(—)p[O1 | > al s& Pro(E)u[61 “O38 - Pu) uel | "5 3h, 
This shows that 71’ > 0 as 6,, 6: > © and that is what we need to show that 
[z(03) — 2(0;) |" —0 as 4 j Oo —> ow. 


So we have verified that the third derivatives of the imbedding converge uni- 
formly to the third derivatives of the limit imbedding (which therefore must 
exist and be continuous). 


To see that the third derivatives of the metric converge consider 


A> 85 
g(@.) — g(A) l g do = i (M — F) dé 
1 


76, 
89 


G9 
M dé — E dé 


64 64 


wld | | + po, [41 | ~o'3l + 7A | ever? 


LA 
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from (B56) and (B30). Since ys, > 0 as 6, > ~, this gives us the convergence 
of the metric and its derivatives up to the third order. 


Summary of results 


The main results of this part of the paper can be summarized in one theorem: 


THEOREM 1: 
HyYPpotTHESEs: 


(1) Mt 7s a compact manifold analytically imbedded in a euclidean space. 

(2) The system (B8a, b) of linear equations in the 2, is non-singular at all 
points of the rmbedding. 

(3) G ts a symmetric covariant tensor on IN representing the change we want to 
make in the metric induced by the tmbedding of It. We want to accomplish this 
change by modifying the imbedding. 

(4) GisC’ where38 Sk S ~. 

(5) 0) ts the parameter determining the initial amount of smoothing in our per- 
tubation process. 


CONCLUSIONS: 


If 00 1s taken sufficiently large and if G and its derivatives up to the third order 
are sufficiently small, then the perturbation process will produce a perturbed im- 
bedding of MN that is C* and induces a metric tensor on IN which differs by the amount 
G from the metric induced by the original imbedding. 


Part C: PREPARATORY METRIC APPROXIMATION 


In Theorem 1 of Part B we have the means for making small changes in the 
metric induced by an imbedding. Here in Part C we learn how to arrange that 
only a small change is needed. This solves the imbedding problem for compact 
Riemannian manifolds. 


The addition property 


Suppose a manifold Jt has two imbeddings, one by functions z* into H” and 
the other by functions y° into E”. Let 


02a 02a 

J. = ae Bar: a; and 
We dYp O 

= Ys Ys 

Jy = Dy: ax, Ox; 
be the two metric tensors defined by the two imbeddings, both referred to a 
system 21, %2, °°: 2X, of local coordinates in I. The total set of functions 2; , 
22, °'* 2m, Yi, Y2, °°* Yp defines an imbedding of J into the product space 


E™ X E’. The metric tensor induced on J by this product imbedding is g, + gy . 
This is the addition property of metric tensors. 
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This property enables us to separate into two parts the problem of con- 
structing an imbedding of Xt such that we can successfully apply Theorem 1 
and obtain an isometric imbedding. Suppose g is the intrinsic metric of Jt which 
we want to realize by an imbedding. We first find a z-imbedding which is “‘per- 
turbable’’, that is, one such that the equation system (B8a, b) is everywhere 
non-singular. Then we find a y-imbedding such that g, + g, is close to g. The 
perturbation process is now applied only to the z-imbedding, directed towards 
effecting the change G = g — (g. + g,) in the metric induced by the z-imbedding. 

Actually, the form of Theorem 1 forces us to proceed somewhat like this. 
This theorem tells us that for any perturbable imbedding a sufficiently small 
change G in the metric can be accomplished. To use it we must be able to make 
G small without changing the imbedding which is to be perturbed. And deriva- 
tives of G up to the third order must be small. Thus it is rather important that 
we make G(= g — g. — gy) small by adjusting the y-imbedding, leaving the 
z-imbedding fixed. 

The metric g, must be a positive metric, so it is clear that it would be im- 
possible to use the approach we outlined above unless g — g, were a positive 
metric, which g, would approximate to. If we have a zimbedding that is per- 
turbable (i.e. where (B8a, b) is everywhere a non-singular system) we can 
always simply make a change of scale, if necessary, to make g, as small as desired 
so that g — g, will be positive. This does not affect the quite qualitative question 
of the singularity or non-singularity of (B8a, b). 


A simplified approach 


The method we actually use here for constructing the z and y imbeddings has 
a, certain intricacy occasioned by our desire to bound the number of dimensions 
needed for the final imbedding and to get a relatively good bound. But if one 
throws out all concern for the number of dimensions to be used the problem can 
be handled rather simply. Therefore we indicate here the simpler approach for 
the benefit of those who may not want to bother with the more complicated 
details of our method. 

One could get by with two lemmas: 

[, : Every compact differentiable manifold may be imbedded as an analytic sub- 
manifold of some euclidean space in such a way that (B8a, b) is everywhere non- 
singular. 

L, : Any compact Riemannian manifold with C" positive metric, where3 Sk S ~, 
can be represented as an analytic sub-mantfold of euclidean space so that the induced 
metric and its derivatives up to the third order approximate the given metric as closely 
as desired. 

These lemmas would not give us any indication of the number of dimensions 
necessary. 

L, can be proved very easily by a direct construction. Take an analytic im- 
bedding of Jt in E’”, where n = dim Pt. Let v1, v2, +++ , Ven be the coordinates 
of E’”. Then the 2n” + 3n functions 1, v2, -+* 5 Von j Vi V2, °°") Van j Vive, 
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V3, °**, Vd3, -+: define an imbedding of IM in (2n” + 3n)-space if they are 
used as imbedding functions. This imbedding has the property we want, that 
(B8a, b) is everywhere non-singular. To verify this, consider n of the v’s as 
local coordinates at a point of Mt. [See (C7).] 

Lz could be proved by considering finite dimensional approximations to a C” 
isometric imbedding of Xt in Hilbert space. Or one could use a result of J. 
Schwartz that implies L, . It is relatively easy to patch up a good approximation 
to a metric when there is no limit on the number of imbedding functions to be 
used. Each function might vanish except on a small neighborhood of MN. 


Outline of the method 


Our method for constructing y-imbeddings can use a minimum number of 
dimensions if the z-imbedding is chosen with some care. It turns out that if 
g — gz is close to a favorable metric y the y-imbedding can be made in n’ + 3n 
dimensions. 

Our first step is to find a favorable metric y. Then we construct a 2-imbedding 
such that g — g, approximates y (derivatives are not involved here). Finally we 
construct the y-imbedding so that g, approximates g — g, and its derivatives. 


Determination of y 


The special mechanism we use in constructing the y-imbedding requires a 
set {y¥’} of functions on Jt such that the set of symmetric tensors 


, _ dy’ ay’ 
(C2) Mi; = 307 On; 
has at every point of It a subset of 4n” + 4n tensors that are linearly inde- 
pendent. The zx,’s are local coordinates in Xt. The metric y will be the sum 
over r of the M;;, or in other words, it will be the metric induced by the im- 
bedding defined by using the y’ as imbedding functions. 

Our construction of the y’ is based on a dimensionality argument. If we ar- 
range to deal exclusively with algebraic functions and conditions we can think 
in terms of the precise dimensionality concepts of algebraic geometry (such as 
those based on transcendance degree). Therefore let 2 be an algebraic representa- 
tion” of Nt in E*. The mapping between Nt and % can be as differentiable as the 
differentiability structure on Jt, and analytic if 2 has a global analytic structure 
(not mere overlapping local coordinates analytically related). Then a C* metric 
on Jt becomes a C” metric on W. 

Let us first see how we can find a set of functions which has the independence 


or ‘‘(C2) property” in the ele eor coe of a point of A. Suppose 2%, %2, °*+ , Xn 
are local coordinates. Then the $n? + $n functions 


(C3) f? =2;4+2;, where 2 S J, 


2 Reference [16] proves that there is an algebraic representation of any closed differ- 
entiable manifold. 


This content downloaded from 
76.147.6.157 on Fri, 05 Jan 2024 01:35:50 +00:00 
All use subject to https://about.jstor.org/terms 


IMBEDDING OF RIEMANNIAN MANIFOLDS 55 


can easily be seen to suffice. At any point of 2 some n of the coordinates u , 


Us, ***, Ua of EH* will be suitable as local coordinates. Therefore the $a? + $a 
functions 
(C4) f?? = ug + us, where 8 S 6, 


will have the ‘‘(C2) property”? everywhere. But this is more functions than we 
want to use. 

A plausibility argument for the number of functions y’ that should be neces- 
sary so that there are always in* + 4n linearly independent Mj; at a point 
of % goes as follows: 3n” + 4n functions are good locally, so they would fail on 
a sub-manifold of n — 1 dimensions. Adding a total of n — 1 functions should 
reduce it step by step to a zero-dimensional set of singular points. Then one more 
function should eliminate these. Thus we should need 4n? + 14n functions 
in all. 

1,? + J2n is the correct number, although that argument is not rigorous. 
Define 1)” + 14n functions 

p= 1,2,---,4n° + Lin 


(C5) v= Lis Cas ere | 


as linear combinations of the coordinates of E*. We shall show that a generic 
choice of the coefficients Cs automatically gives a set of y” with the desired prop- 
erty. Let s = 3n” + 4n. Then there are s + n of the y’ and (s + n)a of the 
coefficients Ce. | 

Our dimensionality argument is based on analyzing the family of ways in 
which a set of the y’ can fail to define independent M7;. If the M7; are not 
linearly independent at a point p of % they lie in some linear sub-space H, 
of the space L, of all values of symmetric tensors (with two subscripts) at p. 
We can consider only sub-spaces H, which have one less dimension than the 
whole linear space L, . Since dim L, = s, dim H, = s — 1 and the dimension 
of the family of sub-spaces H, of L, is s — 1. The dimension of the family of 
all H, for all points p of W2isn + 8s — 1. 

For any r the coefficients (Ci, C2, --- , Ca) can be chosen so that y’ is any 
one of the functions i of (C4). Therefore for any particular H, , one can select 
(Ci, C3, -+-, Ca) so that M%,; (which they determine, via y’) does not lie in 
H, . If this were not so the f °° would not have the “‘(C2) property,” but we saw 
that they did have this property. Since not all selections of the constants deter- 
mining ¥’ make M7; lie in H,, the dimension of the family of selections of 
(Ci, C2, ---, Ca that do make Mj; « H, is not more than a — 1. 

The dimension of the family of selections of all the Cg which make all the 
Mj; lie in H, is clearly not more than (s + n)(a — 1) since there are s + n 
sets of a coefficients determining the s + n functions y” which determine the 
M7; . Now since the family of all H, has dimension n + s — 1, the dimension 
of the family of selections of the C% for which there is some H, such that all the 
MM? lie in H, at p can be at most [(s + n)(a — 1)] + [n + 8s — 1]. But this 
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number is (s + n)a — 1, one less than the dimension of all selections of the C4 . 
Consequently with a generic selection of the Cs there will be no point p and 
tensor sub-space H, where all the M7; lie in H,. In other terms, the y’ will 
have the ‘‘(C2)’’, or independence, property we want. 

One can be quite explicit about how the Cg can be chosen. The n-dimensional 
variety associated with 2 (which is the smallest variety containing %) can be 
defined by a set of polynomial equations in the coordinates of EZ”. Adjoin all the 
coefficients involved in these equations to the field of rationals to produce an 
extension F’. If the Cs are algebraically independent over F the y’ will have the 
desired property.’ Obviously this is a sufficient but not necessary condition 
for the proper selection of the C6. 

We have said that the “favorable metric” y would be the sum of the M7, , 
so that 


: ay’ oy’ 
C6 y= m3 i 
(C6) Vis pa 135 = Die ae 
We also said that our procedure was to make g — g, = y. This means g, Y g — 7, 
so we must certainly have g — vy a positive metric. To take care of this let us 
assume that a definite choice of the Cg is made in such a way that they are small 
enough to make g — y be a positive metric. 


The Z-imbedding 


There will be a C’ imbedding of St in E’” which realizes the metric g — y 
exactly [9]. We can think of this as a mapping from % into E’” and approximate 
it in the C’ sense by an algebraic imbedding %. Let g, be the metric induced by 
this imbedding 8. The approximation between g, and g — vy can be as close 
as desired, but it does not extend to derivatives. 

Let v1, v2, °:*, V2, be the coordinates of H’”. At any point of 8 some n of 
these will be suitable as local coordinates; let 21 , x2, --+ , 2n be this subset. 

Now consider the functions 

ei = 2X; 


(C7) i 


Cig = Ui; 


IA 
ae 


If these are the functions z, of (B8a, b) that system takes a very simple form: 


OZa . : 
De a ee = 4; = 0 
LX; 


O’2. : ; ; 
2 = —23y = giz. 
due dx; dx; ~* i 


3 By using rational coefficients in the approximating polynomials used in [16] one would 
obtain an algebraic imbedding YW such that the equations defining the corresponding variety 
would have rational coefficients. Then it would suffice to simply select the Cé as independent 
transcendentals over the field of rationals, without reference to Y. 
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The solution is apparent for any g;;. The system is non-singular in the most 
obvious way. Each linear equation has just one variable (remember the 2Z, 
are the variables) with non-vanishing coefficient and this is a different variable 
in each equation. 

The system (B8a, b) has the property that once one has a set of functions 
Z, that makes the system non-singular then the introduction of new functions 
Za (which also introduces new variables z.) can only improve the situation. 
The coefficients of each variable z, in the equations can be regarded as a vector 
V. with 3n” + 1n + n or 4n’? + 14n components. The system is non-singular 
if there are 3n” + 13n, which is s + n, linearly independent V,’s. Additional 
functions Z, simply introduce more V,’s. 

Define s + 2n (or 3n” + 23n) quadratic functions of the coordinates of E’” by 


B5 
(C8) fa = wee Cc Ve + > ise<s <2n Da UBUs 


where 1 S a S s + 2n. At any point of 8 we can make the z, contain the func- 
tions defined by (C7) which are suitable at that point by an appropriate choice 
of the C’s and the D’s. An argument exactly analogous to the one we used 
above in finding the y’ shows that a generic choice of the C’s and D’s gives 
functions z, that make (B8a, b) non-singular. If the C’s and D’s are algebraically 
independent over the field of definition for 8 they define satisfactory func- 
tions 2. . 

Now how do we arrange that g, & g — y? We select all Do ~ 0 and C4 & 
unless 8 = a, when we select Ca ~ 1. This makes the z-imbedding, which is in 
E*** be approximately the same as %, because all the z* are quite small, except 
for the first 2n, and these are approximately the coordinates 11, ve, --- , Ven 
of E’”. Thus g, & g, and hence g, &% g — y since m XY g — ¥. 


The Y-imbedding 


This is constructed, with the aid of the y’ of (C2), by means of a special 
device. The device will produce an imbedding with metric g, approximating 
any metric of the form 


(C9) g(a, » 42, °°", s4n) = dy aM ;; ) 


where the M i; are those of (C2) and the a, are positive analytic functions on IM. 
(Regard IN as having an analytic structure corresponding to those of its im- 
beddings 2 and %.) This approximation will apply to derivatives also. 


Note that g(1, 1, --- 1) = y. Since the Mj; are linearly independent at each 
point and g — g,  y, we can certainly represent g — g, in the form 
(C10) 9 — gz = De a,M i; 


at any point of St, and we can have | a, — 1| S e where ¢« is a small uniform 
bound that depends on the closeness of the approximation of g — g, toy. 

But can we do this in a uniform manner so that the a, become continuous 
functions on Jt? The solution for the a, of (C10) will be non-unique at every 
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point because there are s + n of the a, and only s components of g — g,, thus 
only s equations. This redundancy can be removed by the same device which 
removed the redundancy of (B8a, b) in Part B. Here we desire positive a, , 
SO we specify: 


(C11) >. (a, — 1)? = minimum, subject to (C10). 


This determines the a, uniquely and makes them have the same differentiability 
as g — g.. Since (C10) has a solution with |a, — 1| S ¢, the solution of the 
modified system (C10, 11) will have | a, — 1| S e€(s + n)}. Assume e¢ is small 
enough so that this makes the a, necessarily positive. 

We define g(a, G2, --- , Onze) by approximating the a, by positive analytic 
functions a,. The a, must also approximate the a, for derivatives up to the 
third order. Hence g — g, must be C’ so that the a, are C*. Since a, 3 a, We 
shall have g(a, d2, --* , Qnis) 39 — G- 


The device 


This device is reminiscent of one used for constructing C’ imbeddings.* 
Suppose A is a large constant. We define 2(s + n), which isn” + 3n, y-imbedding 
functions as 


Yr = — sin (Ay’) 


Yr =a oo COS (xy’). 


(C12) 


When these are used as imbedding functions they induce the metric 


_ SF Yr OY, OG, OF, 
M1 = Lise a de ax; Ox; 


When this is expanded by substituting the formulas for y, and 7, many terms 
cancel. All terms which contain \ ~~ occur as pairs which differ only by containing 
either [sin (Ay’)]-[cos (Ay’)] or [cos (Ay¥’)]-[—sin (Av’)] and cancel together. The 
remaining terms can be combined with the identity sin’ + cos’ = 1. This finally 
gives 


oy’ Oy" | \-2 ~~ A(a,) A(a,) 
2a ee 
2 Bx, 2; ax, dz; 


fy = dir Or Mi; + XG 
— g(a, _* » Asn) + NG. 


Now g is an analytic tensor independent of \. By choosing \ very large the error 
\~*g and any number of its derivatives. can be made as small as desired. So we 
have 


(C14) Fy ~s g(a , G2,°"° Cea) 39 — Jz; 


4 See equation (13) page 387 of reference [9]. 
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where =; indicates approximation up to third derivatives. Thus we have 
Jy + g: ~s g, which is what we need to apply Theorem 1. This requires, of 
course, that g be C’. 


Summary and applications 


We used s + 2n, which is 4n’ + 24n, z-imbedding functions and 2(s + n), 
orn’ + 3n, a obeddine functions. This is 3s + 4n or lin? + 54n functions 
altogether. The z-imbedding was analytic and made (B8a, b) everywhere non- 
singular, so that Theorem 1 could be applied to it. The y-imbedding was also 
analytic and was adjustable so that g, + g, could approximate g as closely as 
desired (this approximation including derivatives up to the third order). For 
this, g had to be C”. 

The z--and y-imbeddings can be arranged to take up arbitrarily little space. 
The z-imbedding approximates a C’ imbedding of 9% which realizes the metric 
g — y. Since a C’ isometric imbedding can be made arbitrarily small (and highly 
twisted) so can the z-imbedding. If the parameter } is very large the y-imbedding 
is very small. 

The fact that the z-imbedding approximates a C’ bedding serves to prevent 
self-intersections in the final imbedding of the manifold in Eur’tin Since the 
amount of perturbation needed can be made arbitrarily small by sa ustne the 
y-imbedding, the application of the perturbation process to the z-imbedding 
need not produce self-intersections. 

Now we state the result obtained by combining the work of Part C with 
Theorem 1 of Part B. This is our “‘main theorem.” 


Imbedding of compact manifolds 


TuEoREM 2. A compact n-manifold with a C” positive metric has a C“ isometric 
imbedding in any small volume of euclidean (n/2)(8n + 11)-space, provided 
3S ks ow. 


Part D: NON-COMPACT MANIFOLDS 


Our treatment here is not a direct attack. It exploits a device by which the 
imbedding problem for non-compact manifolds is reduced to the problem for 
compact manifolds. This approach gives a poor upper bound on the number 
of dimensions needed for the imbedding space; but that is the price of taking a 
shortcut to the non-compact case. 


A special mapping 


Our basic tool isa C” mapping of E” onto then-sphere S”. Most of ZL” is mapped 
into the “north pole” of S”. The interior of the unit disk of #” covers the re- 
maining portion of S” in a one-to-one manner. Any mapping of EF” on S” with 
these properties will serve our purpose, but to illustrate we construct one. 

Take the case of the (x, y) plane, or E”. This can be mapped on the 2-sphere 
P47 + °° = 2 as follows: 
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For x + 7° < 1, let 
Q = exp (2 + y° — 1)~ 


= 
VP+e’t+y 
_ yQ 
"FP FY 
Sea 
2 GH at t+ y?? 
for 2 +y' = 1, =7=0, f= 3 


It is easy to see that £, 7, and ¢ are C” functions because Q isa C~ function if 
it is assigned the value zero for x” + y’ = 1. A direct check verifies that 


Stayt est. 
The equations define a non-singular one-to-one mapping of the open disk 
z’ + y’ < 1 onto the sphere minus the “north pole” (£ = 7 = 0, ¢ = 3). This 


mapping is obtainable by taking a mapping of the open disk onto the whole 
plane, 


z= 2/Q 
= y/Q 

and following this by the classical conformal mapping of the plane onto the 
sphere (minus the ‘“‘north pole’’). The effect is to give a mapping which has a 


C” extension to the rest of the plane where all points not interior to the disk map 
into the “north pole’’. 


| 


Patch mappings 


Consider a C~ Riemannian n-manifold Jt (the metric need not be C™, but 
M has a C™ structure). A local coordinate system or neighborhood N in JR can 
be regarded as the image of the unit disk D of HE” under a C” mapping of D 
into I. This mapping should be one-to-one, non-singular, and extensible to an 
open set of #” containing D, as a C” non-singular mapping. Then an open set 
containing N is mapped on the open set containing D by the C” inverse mapping. 

For brevity call S the n-sphere. Our special mapping will map any open set 
containing D onto S in a C~ manner. This mapping, with the inverse mapping 
N — D, gives a mapping 


~ 
N~S 
which is C” and has a C” extension to an open set containing N. ¢ maps all 


points on the boundary of N or outside N into the “‘north pole” of S. Clearly we 
can extend the definition of ¢ to all points of J? by mapping all other points into 
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this “north pole’”’. ¢ will remain C”. This mapping ¢ has non-vanishing Jacobian 
in the interior of N, so gis C® there. ¢ is called a patch mapping. 


Appropriate coverings for 5) 


Nt can be covered by a family of disk neighborhoods NV, in such a way that we 
can divide the NV; among n + 1 classes where: No two N; of the same class over- 
lap. Each N; overlaps only a finite number of other N;. 

How is such a covering constructed? First obtain a regular star-finite cellular 
sub-division of 9. Then form a disk neighborhood corresponding to each vertex, 
edge, --- , face, or cell of the cellular sub-division. Each disk neighborhood that 
corresponds (for example) to an edge covers the middle section of the edge but 
not the end points. These are covered by the neighborhoods which correspond to 
them in their role of vertices. In this way no two edge neighborhoods are allowed 
to meet. The same principle applies up the series of dimensions. The n + 1 
dimensions from 0 through n give rise to the n + 1 classes of neighborhoods. 

Within each N; we can select a slightly smaller disk neighborhood N,; such 
that the NV; also cover IN. These N; should correspond to sub-disks of D (through 
the mappings between D and the N ;). Then we can select a C™ function wu; for 
each N; which is positive interior to N; and zero on the boundary and outside 
N;. Each u; can be regarded as defined and C™ over all of M. 

Now if we define 


vi = ui/ >i U; 


the v; form a partition of unity by C” functions, each of which is positive interior 
to the corresponding sub-neighborhood N; and zero everywhere else. 


Assignment of metrics 


Each N; has an associated patch mapping 


Yi 
N; 7 S;. 


We write S; to distinguish different n-spheres for different N;. The mapping 
vy; has a C’ non-singular inverse y; on N;. 

Consider a metric yi. on S;. This gives a corresponding metric gio on N;. 
Actually gio can be regarded as defined for all 9)t because it will be zero at the 
boundary of N; and can be extended by defining it as zero on the rest of Mt. 
If it is C* on S; it will be C* on M also. If we select a metric yi on each S; which 
is positive, C”, and sufficiently small the corresponding metrics gio will add on 
Mm to a metric 


Jo = > J x0 
which is C~ and everywhere shorter than g, the metric we are given on Jt, and 


which we want to realize by an imbedding of Yt. For example, each y,» could be 
the metric induced by an imbedding of S; in EZ” as a small geometrical sphere. 
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Now, since g — go is a positive metric, 


gi = go + v:(g — go) 


will be positive and will be as differentiable as g (say C”). g; differs from gi only 

within N,; where the mapping ¢; has a non-singular inverse ¢;'. Therefore g;* 

carries g; — gio over to S; as a C” non-negative metric y; — yi. That is, there is 

a positive C” metric y; on S; which corresponds, via g; , to the metric g; on N;,. 
Consider the sum 


> ji = > Jio + (>; vi)(g — go) 
go + (9 — go) 


Realization of metrics 


We have defined a C” metric y; on each S;. Assuming 3 < k S «, y; can be 
realized by a C* imbedding of S; in B&/”°"*"? by our Theorem 2. We can always 
let the “north pole” of S; be at the origin. 

Now consider all the N; of one of the n + 1 classes, let us say class C. The S; 
corresponding to each has an imbedding in E‘”’”°"*™ that realizes y; and maps 
the ‘‘north pole’’ into the origin. The corresponding patch mappings ¢;, to- 
gether with these imbeddings, define a mapping yc of Mt into Bert”, 
This Wc is C~ and mapsall points of IN, except those interior to any neighborhood 
N; of class C, into the origin. Each of these other points can be in only one N,, 
so the mapping is unambiguously defined. 

This mapping wc induces a metric on Jt which is the sum, 


of the metrics g; associated with neighborhoods N; of class C’. The product map- 
ping 

YHh Kye Kees K Wats 
maps Nt into (n + 1)(n/2)(3n + 11) dimensional space and is also C*. It induces 


the metric 
CG: == 2 Je, 


as we desired. If the image of Jt under y has no self-intersections it is an isometric 
imbedding. In any case it is a C” isometric immersion. 


Avoidance of self-intersections 


Self-intersections can be avoided by using the fact that the isometric imbedding 
of S; can be made as small as desired. Let a; be the minimum distance of a point 
of N; from the origin after it is mapped into pean by ¢; and the imbedding 
of S;. Let 8; be the maximum distance from the origin of points of N; after 
being mapped into Fir’. Think of ¢ as a serial index running 1, 2,3---. 
Now all we need do to avoid self-intersections is to arrange the imbeddings of 
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the S; in order of increasing 7 so that for all 7 


Bi < min Qj. 
f<i 


Why is this sufficient? First, any two points of Jt that are interior to a com- 


mon JN; are distinguished through the imbedding of S; . So we need only consider 
pairs of points which lie in completely different sets of neighborhoods. In this 
case one of the points will be in a sub-neighborhood NV; with lower index than the 
other. Then this point will be further from the origin with respect to the set of 
14n” + 54n coordinates associated with N; than the other point can be. 


The result 


Our theorem is: 
THEOREM 3. Any Riemannian n-manifold with C* positive metric, where 3 


<k < «, has aC isometric imbedding in (1in® + 7n” + 54n)-space, in fact. 
an any small portion of this space. 


i 
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